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Structure of the X (ADRP) domain of nsp3 from 


feline coronavirus 


The structure of the X (or ADRP) domain of a pathogenic 
variant of feline coronavirus (FCoV) has been determined in 
tetragonal and cubic crystal forms to 3.1 and 2.2 A resolution, 
respectively. In the tetragonal crystal form, glycerol-3- 
phosphate was observed in the ADP-ribose-binding site. Both 
crystal forms contained large solvent channels and had a 
solvent content of higher than 70%. Only very weak binding 
of this domain to ADP-ribose was detected in vitro. However, 
the structure with ADP-ribose bound was determined in the 
cubic crystal form at 3.9 A resolution. The structure of the 
FCoV X domain had the expected macro-domain fold and is 
the first structure of this domain from a coronavirus belonging 
to subgroup la. 


1. Introduction 


Coronaviruses are positive-stranded RNA viruses that belong 
to the order Nidovirales (Gorbalenya et al., 2006). Their virion 
ranges from 80 to 120nm in diameter and contains one 
molecule of RNA of approximately 30 kb (Weiss & Navas- 
Martin, 2005). Transcription and replication of the coronaviral 
RNA genome takes place in the cytoplasm of the host cell at 
replication sites that are associated with a characteristic reti- 
culovesicular network of modified endoplasmic reticulum 
(Stertz et al., 2007; Snijder et al., 2006; Prentice et al., 2004; 
Knoops et al., 2008). The first two open reading frames 
(ORF1la and ORF1b) of the viral genome overlap and encode 
two replicase precursor polyproteins, ppla and pplab. These 
long polyproteins (~4000 and ~7000 amino acids, respec- 
tively) are the substrates for the autocatalytic production of 
15-16 mature nonstructural proteins (nsps), which is driven by 
viral proteases: a 3C-like main protease (M?™; nsp5) and one 
or two (depending on the virus) accessory papain-like pro- 
teases (PL1?° and PL2°"° in nsp3; Weiss & Navas-Martin, 
2005; Ziebuhr et al., 2000). The largest cleavage product 
derived from pp1a/pp1ab is nsp3. This subunit includes diverse 
enzymatic and poorly characterized domains that are involved 
in the replication and expression of the virus genome and 
virus—host interactions (Thiel et al., 2003; Snijder et al., 2003; 
Neuman et al., 2008; Kanjanahaluethai et al., 2007). 

One of the ubiquitous domains present in nsp3 of all 
coronaviruses is the X domain [also known as the adenosine 
diphosphate-ribose-1”-phosphatase (ADRP) domain], which 
was originally identified as a domain that is conserved in 
vertebrate RNA viruses of the alphavirus-like supergroup and 
coronaviruses (Koonin et al., 1992; Gorbalenya et al., 1991). Its 
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proposed enzymatic activity was based on its distant similarity 
to a cellular enzyme (Snijder et al., 2003), the activity of which 
was first identified in yeast proteins (Shull et al., 2005; Martzen 
et al., 1999; Kumaran et al., 2005) and subsquently in the 
Archeoglobus fulgidus protein AF1521 (Karras et al., 2005). 
ADRP activity has also been shown for the severe acute 
respiratory syndrome virus (SARS-CoV), human coronavirus 
HCoV-229E and porcine transmissible gastroenteritis virus 
(TGEV) X domains (Putics et al., 2005, 2006). The RNA viral 
X domains and their cellular homologues, which can be found 
in eukaryotes, bacteria and archaea, belong to a large struc- 
tural family prototyped by a macro domain (Pehrson & Fuji, 
1998). The human macroH2A1 macro domain has been shown 
to bind O-acetyl-ADP-ribose (Kustatscher et al., 2005) and 
both the yeast protein and the A. fulgidus protein AF1521 can 
bind ADP-ribose (Karras et al., 2005; Kumaran et al., 2005). It 
is believed that this activity or the binding of related ligands 
[for example poly(ADP)-ribose] might determine the function 
of the macro domain. The X domains of several coronaviruses 
[SARS-CoV, HCoV-229E and infectious bronchitis virus (IBV) 
strain M41; Egloff et al., 2006; Xu et al., 2009] and alphaviruses 
(Malet et al., 2009) have also been shown to bind ADP-ribose. 
Catalytic phosphatase activity and ADP-ribose binding are 
two related but distinct properties of the X domain. Despite 
considerable progress in the in vitro characterization of 
coronaviral X domains, their activity and function in vivo, 
which are apparently linked to virus pathogenesis, remain 
poorly understood (Eriksson et al., 2008). 

Coronaviruses may be divided into several genetic sub- 
groups, the first of which were originally established using 
serological cross-reactivity (Gorbalenya et al., 2004; Lai & 
Holmes, 2001). Subgroup 1a includes feline coronavirus 
(FCoV) and TGEV. HCoV-229E belongs to subgroup 1b, 
together with HCoV-NL63 and porcine epidemic diarrhoea 
virus (PEDV). Subgroup 2a contains the murine hepatitis 
virus MHV and the porcine haemagglutinating encephalo- 
myelitis virus BCoV. In 2003 the new SARS-CoV was classi- 
fied as member of a new subgroup 2b. Group 3 originally 
included only avian viruses such as IBV. Very recently, a 
number of new coronaviruses have been identified that could 
be prototypes of separate subgroups in groups 2 and 3 (Woo et 
al., 2009; Mihindukulasuriya et al., 2008). 

The X domains of nsp3 from coronaviral subgroups 1b 
(HCoV-229E; Piotrowski et al., 2009; Xu et al., 2009), 2b 
(SARS-CoV; Egloff et al., 2006; Saikatendu et al., 2005) and 3 
(IBV; Piotrowski et al., 2009; Xu et al., 2009) have been 
structurally characterized. Here, we report the first crystal 
structure of the X domain from feline infectious peritonitis 
virus (FIPV), which belongs to coronaviral subgroup la. 
Feline infectious peritonitis virus (FIPV) is a pathogenic 
FCoV variant that emerged by mutation of the relatively 
benign enteric FCoV (Vennema et al., 1998; Poland et al., 
1996) and causes a fatal immune-mediated disease in cats 
(Pedersen, 1995). Since the variations are minor and despite 
the fact that we are working with a FIPV construct, we will 
henceforth use FCoV as an abbreviation for the virus. The 
FCoV X-domain structure was determined in tetragonal and 


cubic space groups to 3.1 and 2.2 A resolution, respectively. In 
addition, the structure with bound ADP-ribose was deter- 
mined in the cubic crystal form to 3.9 A resolution. We 
analyzed the similarity of the FCoV X-domain structure to 
those of other coronaviral X domains and macro domains 
from other organisms. 


2. Materials and methods 
2.1. Cloning 


Vector pDEST14 (Invitrogen) containing the X domain of 
nsp3 (residues 1254-1421 of the polyprotein ppla) from feline 
coronavirus (FCoV; strain FIPV WSU-79/1146; GenBank/ 
RefSeq accession No. NC_007025.1) was originally created 
using Gateway cloning technology (with a C-terminal His, 
tag). The X domain was recloned into the pETM-11 vector 
(EMBL Hamburg), which allows the removal of an N-terminal 
His, tag with tobacco etch virus (TEV) 3C-like protease. The 
point mutant Leul22Met was created using the Stratagene 
QuikChange mutagenesis kit. The nucleotide substitution was 
confirmed by DNA sequencing (MWG Biotech). 


2.2. Expression and purification 


Protein expression was performed in Escherichia coli strain 
Rosetta (DE3) pLysS (Novagen). Cultures were grown in LB 
medium at 310 K until the OD¢o9 reached 0.8, induced with 
1 mM isopropyl f-p-1-thiogalactopyranoside (IPTG) and left 
shaking overnight at 288 K. Cells were collected by centrifu- 
gation at 4000g (30 min, 277 K) and frozen at 253 K. 

The bacterial pellet from 1 1 cell culture was resuspended in 
20 ml lysis buffer [50 mM Tris pH 8.0, 150 mM NaCl, 5%(v/v) 
glycerol]. Cells were lysed by sonication and centrifuged at 
38 000g in SS-34 centrifuge tubes (50 min, 277K). The 
supernatant was filtered through a 0.45 um pore-size mem- 
brane (Sartorius Stedim Biotech), loaded onto 5 ml Ni-NTA 
beads (Qiagen) pre-equilibrated with lysis buffer and incu- 
bated for 30 min at 277 K. The beads were then washed with 
high-salt buffer (50 mM Tris pH 8.0, 500 mM NaCl) and the 
protein was eluted with 50 mM Tris pH 8.0, 150 mM NaCl and 
500 mM imidazole. The fractions collected were analyzed by 
SDS-PAGE and the protein sample was buffer-exchanged to 
lysis buffer using PD-10 columns (GE Healthcare). 

The protein obtained using the pETM-11 vector (wild type/ 
Leul22Met) was incubated overnight with His-tagged TEV 
protease (EMBL Hamburg) at 277 K (in a 50:1 ratio). The 
sample was then loaded onto Ni-NTA beads equilibrated with 
lysis buffer. The flowthrough was collected and the beads were 
washed with 20 ml lysis buffer. The fractions collected were 
analyzed by SDS-PAGE. 

The protein sample was loaded onto a Superdex 75 (16/60) 
gel-filtration column (GE Healthcare) equilibrated with lysis 
buffer. The elution volume of the protein corresponded to a 
monomer according to the column calibration (Gel Filtration 
Standards, Bio-Rad). The purity of the sample was checked by 
SDS-PAGE. The protein solution was concentrated to 
11 mg ml using a 10 kDa molecular-weight cutoff centrifuge 
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Table 1 


Data-collection and refinement statistics. 


Wild type Leul22Met mutant 


Wild type + ADP-ribose 


Data collection 


Crystallization condition Ammonium sulfate Ammonium citrate 


X-ray source ESRF ID29 ESRF ID14-2 

Space group P4,2,2 P2,3 

Unit-cell parameters (A) a=b= 161.1, a=b=c=2189 
c = 98.3 

Wavelength (A) 0.979 0.993 

Resolution range (A) 50.0-3.1 50.0-2.2 


Mosaicity (°) 0.17 0.14 


Mean J/o(1) 19.2 (3.5) 20.3 (5.6) 

Ryac (linear) (%) 7.1 (46.0) 15.4 (60.7) 

Redundancy 4.0 21.5 

Rieas (%) 8.0 (51.6) 15.8 (62.1) 

No. of observations 94994 3791311 

No. of unique reflections 23603 176075 

Completeness (%) 99.1 (98.1) 99.8 (98.9) 
Refinement c 

Resolution range (A) 24.7-3.1 24.8-2.2 

No. of reflections (working/free) 22625/1225 167093/8797 

No. of protein residues A, 168; B, 166; A-H, 168 

C, 168 

No. of waters 79 2343 

No. of ADP-ribose molecules — — 

No. of G3P molecules 3 _ 

No. of sulfate molecules 6 = 

No. of chloride ions 4 = 

No. of sodium ions _ _ 

Ryork!Riree (%) 20.01/24.20 14.70/18.13 

Average B (A’) 60.0 39.9 

Geometry bonds (A)/angles (°) 0.014/1.6 0.020/1.6 

Diffraction data precision 0.31 0.19 


indicator (A) 


Ammonium citrate 
EMBL X13 

P23 

a=b=c= 220.2 


plus increasing concentrations of one of 
the following: glycerol [4.25-17%(v/v)], 
MPD [5-20%(v/v)], ethylene glycol [5- 
20% (v/v)] or PEG 400 [2.5-10%(v/v)]. In 
each case after 12h a few crystals were 
flash-cooled in liquid nitrogen. Crystals 
incubated in the 4.25% glycerol condition 
diffracted to 3.1 A resolution. 


0.812 
ea 2.4. Data collection and processing 
ee Ae 6) Crystals grown with ammonium sulfate 
75 (from protein expressed from pDEST14 
60.1 (97.5) vector) were dehydrated using 4.25% 
20aie glycerol and flash-cooled in liquid nitro- 
32246 : : : 
98.9 (98.5) gen. Single-wavelength X-ray diffraction 
data were collected from single crystals at 
Lee 68 100 K on European Synchrotron Radia- 
A-H, 168 tion Facility (ESRF) beamline ID29 using 
an ADSC Quantum 315R detector. The 
ae crystal-to-detector distance was main- 
7 tained at 425.1 mm with an oscillation 
= range of 0.6°. 100 images were collected 
: to a maximum resolution of 3.0 A. 
19.88/23.91 Crystals grown in ammonium citrate 
31.0 (Leu122Met protein) were cryoprotected 
sea 20 in 4.5 M sodium formate. A single-wave- 


concentrator (Vivaspin, Vivascience). The protein concentra- 
tion was determined using a NanoDrop spectrophotometer 
(Thermo Scientific). The protein sample was stored at 277 K 
(for up to one week) or flash-frozen in the presence of 
20%(v/v) glycerol and kept at 193 K (for up to six months). 


2.3. Crystallization 


Initial crystallization trials were carried out at 292 K using 
the sitting-drop vapour-diffusion method in 96-well Greiner 
plates at the EMBL Hamburg High-throughput Crystal- 
lization Facility (Mueller-Dieckmann, 2006). Crystals were 
obtained from 2.4 M ammonium sulfate, 0.1 M MES pH 6.0 
(for protein expressed from pDEST14 and pETM-11) and 
1.8 M triammonium citrate pH 7.0 (for protein expressed from 
pETM-11 vector). Further optimization of the crystallization 
conditions was performed manually in 24-well plates (Qiagen) 
using the hanging-drop vapour-diffusion method at 292 K. 
Crystals were obtained from a 2 ul drop of 6 mg ml™! protein 
in 2.6-2.8 M ammonium sulfate and 0.1 M MES pH 5.0-6.0 or 
11 mg ml"! protein in 1.4-1.6 M diammonium citrate pH 6.0- 
7.0. 

The crystals obtained using the ammonium sulfate condi- 
tion initially diffracted to 4.5 A resolution. In order to improve 
their diffraction quality, a dehydration method was imple- 
mented (Heras & Martin, 2005). Crystallization drops 
containing crystals were equilibrated for 12h at 292K in 
reservoirs containing crystallization-condition components 


length data set containing 360 images was 
collected from single crystals at 100 K on 
ESRF beamline ID14-2 using an ADSC 


Quantum 4 CCD detector. The crystal-to-detector distance 
was kept at 203 mm with an oscillation range of 0.5°. The 
crystal diffracted to 2.2 A resolution. 

A crystal grown in ammonium citrate (wild-type protein) 
was transferred to a 1 ul drop containing 4.5 M sodium 
formate and 2mM ADP-ribose. After 1.5h incubation the 
crystal was flash-cooled in liquid nitrogen. A single-wave- 
length data set containing 64 images was collected from a 
single crystal fragment at 100 K on the EMBL Hamburg X13 
beamline using a MAR CCD 165 mm detector. The crystal-to- 
detector distance was kept at 300 mm with an oscillation range 
of 1°. The crystal diffracted to 3.9 A resolution. 

The recorded images were processed with XDS (Kabsch, 
1988) and the reflection intensities were processed with 
COMBAT and scaled with SCALA (Evans, 1993) from the 
CCP4 program suite (Collaborative Computational Project, 
Number 4, 1994). Data-collection statistics are shown in 
Table 1. 


2.5. Structure determination 


The structure of the FCoV X domain in space group P4,2,2 
was determined by the molecular-replacement method using 
the program MOLREP (Vagin & Teplyakov, 1997). The 
coordinates of the HCoV-NL63 X domain (Piotrowski et al., to 
be published) served as a search model. The solution showed 
three molecules in the asymmetric unit. Refinement was 
carried out using the program REFMACS (Murshudov et al., 
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1997). The structure was visualized and rebuilt into the elec- 
tron density using the program Coot (Emsley & Cowtan, 
2004). The stereochemistry of the model was evaluated using 
the program MolProbity (Davis et al., 2007). 

Molecule A from the tetragonal space-group solution was 
used as a search model for molecular replacement in space 
group P2,3. The solution found by the program Phaser 
(McCoy et al., 2007) consisted of six molecules. Appropriately 
weighted 2F, — F, and F, — F, maps were calculated at this 
stage. These maps showed additional electron density indi- 
cating the presence of two additional protein molecules in the 
asymmetric unit. The molecular-replacement solution was 
used as a preliminary model for ARP/wARP (Perrakis et al., 
1999), which built most of the eight molecules into the elec- 
tron density. The final refinement was carried out with 
REFMACS. Where necessary, the structure was manually 
modified using the program Coot. The stereochemistry of the 
model was evaluated with the program MolProbity. 

The structure of the ADP-ribose-bound FCoV X domain in 
space group P2,3 was solved using the molecular-replacement 
protocol of the automated crystal structure-determination 
platform Auto-Rickshaw (Panjikar et al., 2005). The structure 
solution from the cubic space group was used as a search 
model for molecular replacement. Appropriately weighted 
2F, — F, and F, — F, maps were calculated for the coordinate 
file obtained from Auto-Rickshaw. These maps showed addi- 
tional electron density in the binding pockets of three mole- 
cules (B, C and D), indicating the presence of ADP-ribose. 
The refinement was carried out using the program REFMACS. 
The structure was visualized and the fit to the electron density 
was verified using the program Coot. The stereochemistry of 
the model was evaluated with the program MolProbity. 

Atomic coordinates have been deposited in the Protein 
Data Bank (the PDB code for the tetragonal space group is 
3ewS, that for the cubic space group is 3eti and that for the 
ADP-ribose bound structure is 3jzt). 

Interfaces between molecules in both crystal forms were 
analyzed with the PSA server (Krissinel & Henrick, 2007). 
Interface areas larger than 300 A? were treated as being 
significant in describing the crystal packing, although the 
domain was clearly monomeric in dilute solution. Interactions 
between molecules were calculated with the CCP4 program 
CONTACT. The maximum contact distance considered was 
3.6 A. 


2.6. Binding assay 


The ADP-ribose-binding assay is based upon the pull-down 
experiment described by Karras et al. (2005). The Ni-NTA 
slurry (~100 pl) was equilibrated with lysis buffer [50 mM Tris 
pH 8.0, 150 mM NaCl, 5%(v/v) glycerol]. 1 ml 27 uM protein 
sample in lysis buffer (from protein expressed using the 
pETM-11 vector) was loaded onto the column and incubated 
for 10 min. In order to check the effectiveness of the immo- 
bilization of the protein on the resin, the absorbance of the 
collected supernatant was measured at 280 nm. 1 ml 27 uM 
ADP-ribose solution was loaded onto the column and incu- 


bated for 30 min. The absorbance of the collected supernatant 
was measured at 259nm. The percentage of ADP-ribose 
bound to the protein was calculated as the ratio between the 
supernatant absorbance and the absorbance of 27 uM ADP- 
ribose solution at 259 nm. The ADP-ribose concentration was 
calculated using an extinction coefficient of 15 400 M~' cm™! 
at 260 nm. 


3. Results and discussion 
3.1. Structure determination 


The wild-type FCoV X domain (residues 1254-1421 of 
ppla, here renumbered as 34-201 in both the pDEST14 and 
pETM-11 vectors) cloned into vector pDEST14 was expressed 
in E. coli cells at high yield (~60 mg per litre of cell culture). 
The protein crystallized overnight from a solution containing 
ammonium sulfate and MES. The crystals belonged to space 
group P4,2,2, with unit-cell parameters a = b = 161.1, 
c = 98.3 A. In this crystal form there were three molecules 
(chains A, B and C) in the asymmetric unit, corresponding to a 
solvent content of 78% (Matthews, 1968; see Supplementary 
Fig. S6'). The structure was refined at 3.1 A resolution to a 
final R value of 20.0% (Riree = 24.2%). All residues, except for 
the C-terminal His, tag, two N-terminal residues in chains A 
and C and four in chain B, could be placed into the electron 
density. The Ramachandran plot showed 91.7% of the resi- 
dues in preferred regions and 8.3% in allowed regions. The 
refined structure contained three glycerol-3-phosphate mole- 
cules, six sulfate ions, four chloride ions and 79 solvent 
molecules. 

The mutated Leul22Met X domain crystallized in the 
presence of ammonium citrate. These crystals belonged to 
space group P2,3, with unit-cell parameters a= b =c =218.9 A. 
There were eight molecules (chains A—H) in the asymmetric 
unit, corresponding to 74% solvent content. The crystals 
contained large solvent channels with a diameter of approxi- 
mately 80 A (see Supplementary Fig. S1'). The structure 
was refined at 2.2 A resolution to a final R value of 14.7% 
(Riree = 18.1%). In all chains only the first three N-terminal 
residues showed no electron density. The Ramachandran plot 
showed 97.8% of the residues to be in preferred regions and 
2.2% to be in allowed regions. The refined structure contained 
2343 solvent molecules. 

The structure with ADP-ribose bound was determined in 
the cubic crystal form at 3.9 A resolution. The crystal 
belonged to space group P2,3, with unit-cell parameters 
a=b=c=220.2 A. The structure was refined to a final R value 
of 19.9% (Ree = 23.9%). Data-collection and structure- 
refinement statistics are shown in Table 1. 


3.2. Overall structure 


The FCoV X domain is a single domain with a mixed a/B 
structure (Fig. 1a). The core of the structure is a single mixed 


1 Supplementary material has been deposited in the IUCr electronic archive 
(Reference: DZ5170). Services for accessing this material are described at the 
back of the journal. 
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f-sheet. The order of the strands in the sheet is 61, 62, 87, 66, 
$3, 65, with the last strand in two stretches, 64a and 64b. The 
central five strands are parallel. The /-sheet is sandwiched 
between six helices, with a1, v2 and w3 packing onto one face, 
and a4, v5 and a6 onto the other. The SARS-CoV and HCoV- 
229E X domains have the same arrangement of f-strands, 
while the IBV X domain lacks the N-terminal strand. The 
unique feature of the FCoV X domain f-sheet is the last 
broken strand. The topology is similar to other coronaviral X 
domains (Fig. 1b). 

Chain A from the cubic crystal form was compared with 
all structures in the PDB using the DALI server (http:// 
ekhidna.biocenter.helsinki-fi/dali_server/). The structure of the 
FCoV X domain is very similar to those of X domains from 
other members of the Coronaviridae (Z scores from 20.1 to 
27.6), as well as to the mammalian nonhistone domain of the 
histone variant macroH2.A (Z scores between 19.7 and 20.0), 
confirming that the FCoV X domain has a macro-domain-like 
fold. 

The r.m.s.d. between the C” atoms of the molecules in the 
tetragonal space group was less than 0.4 A (for 136 C*% atoms). 
In the cubic crystal form the r.m.s.d. between molecules was 
less than 0.15 A (for 142 C% atoms). The r.m.s.d. between C* 
atoms from the tetragonal and cubic space groups was 
between 0.47 and 0.72 A. The smaller r.m.s.d. differences in 
the cubic crystal form are most likely to be a result of the 
higher resolution structure determination (2.2 A). 


3.3. Crystal packing 


The FCoV X domain has been crystallized in three different 
space groups: tetragonal, cubic and orthorhombic (diffraction 
to a maximum of 3 A, data not shown). Three molecules were 
found in the asymmetric unit cell of the tetragonal crystal form 


Figure 1 
(a) Ribbon representation of the FCoV X domain. Loops are shown in blue, a-helices in purple and f-strands in yellow. (b) Topology diagram of the 
FCoV X domain coloured as in (a). 


and eight in that of the cubic crystal form. Orthorhombic 
crystals were obtained in buffer containing sodium formate 
and sodium acetate. Interestingly, soaking cubic crystals with 
heavy-atom solutions led to a change of the space group to the 
orthorhombic form. The implied similarity between the 
packing in the two crystal forms would suggest the presence of 
24 molecules in the asymmetric unit of the orthorhombic 
crystal form. The tendency of the FCoV X domain to oligo- 
merize is striking, taking into consideration the fact that no 
buffer molecules were interpretable in the electron density of 
either the tetragonal or cubic space groups. Interestingly, 
PISA-mediated analysis of X-domain interfaces in both the 
tetragonal and cubic space groups revealed no specific inter- 
actions that could result in the formation of stable quaternary 
structures, which is in agreement with the observation that the 
X domain is monomeric in dilute solution. Analytical size- 
exclusion chromatography demonstrated that under the 
experimental conditions used the protein eluted as a monomer 
(results not shown). However, solutions of the protein do yield 
a heavy precipitate overnight. The observed tendency to form 
multimers might be biologically relevant since the X domain is 
one of many domains of nsp3, which is likely to be part of a 
large replication/transcription complex. Indeed, interactions 
between several nonstructural proteins have been identified 
and are of importance in the structure and function of the viral 
enzyme complexes that have been analyzed (Imbert et al., 
2008; von Brunn et al., 2007). The crystal structure of the IBV 
X domain contains a crystallographic dimer with a relatively 
large interface area of 2600 a? (Xu et al., 2009). In contrast, 
the interface between FCoV X-domain monomers is small but 
well defined, with some interactions occurring in both crystals 
forms. The interactions result in rigid lattices with large 
solvent channels, which might make the crystals capable of 
acting of a host lattice for other biological molecules. The 


(6) 
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Tyr187 


Figure 2 
(a) Ribbon representation of the FCoV X domain with ADP-ribose bound. ADP-ribose is shown as a space-filling model coloured by atom type. (b) 
Interactions between residues of the FCoV X domain and ADP-ribose. Protein residues (in light blue) and ADP-ribose are shown in stick 
representation. For protein regions 73-78 and 159-162 only the main chain is shown. Hydrogen bonds are shown as dashed lines. N atoms are shown in 


blue, C atoms in green, O atoms in red and P atoms in magenta. 


10 a0 a 40 70 
3ET! ----DLILPFYKAG----- KVSFY LDVLINF----LEPDVL IDVFTG---GKL 
BEG ----EKLNAFLVHD----- NVAFYQGBIVDTVVNG - - --VDFDF LDVYTK---GKL 
2FAV EPVNQFTGYLKLTD----- NVALKCVBIVKEAQS ----ANPMV LNKATN ---GAN 
3EWO ----LGSVKPATCEKPKFLEYKTCVGBLAVVIAKALDE FKEFC TADFCG---PDF 
IHJF ----2+-- MEVLFEAKVGDITLKLAI 1TQY-------- H A |TAKACAGDAGLY 
PSD exceed mr escent TRIHVVQGBITKL-------- IHRAAG---PAL 
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3ET! TKRSKEYLKSS---KAIAPGNAVLFENVLEH----- RNGD---SRVEGKIBCNVYKAIAKC---- 
3EG QRLSKEHIGLA---GKVKVGTGVMVE-CDS------ LRI FN RKGK----HERDLIBI KAYNT INNE---- 
2FAV QKESDDYIKLN---GPLTVGGSCLLS-GHNL----AKKCLHY INLNA---GEDIQLIBKAAYENFNS----- 
3EWO VEY CADYVKKH--- Ee TE es ee RHGD--- SNLREKIBVAAYKSVLVG---- 


LH/Z Htc eG ieee Hee FHT I CSGMWSEELKEKBYKAFLGPLEKAEEN 


isPV) LDACLKVROQQQ- --GDCPTGHAVITL-AGDLP ---AKAWVHT VWRG-GEQNEDOL D LNSLRLVAAN 
12R3 VEAVLELRKKN---GPLEVAGAAVSA-GHGLP---AKFWIHCNSBIVWGA- --DKCEELIBEKTVKNCLALADDK 
Secondary structure B4 BS a4 
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3ETI DG-KILT Fe VKLEVSLOCLLKTYD a mim RDLNVFVYTDQERVT I ENFFNG--- 
3E/G QG-TPLT IKLETSLEVLEDVCNT---------- KEMKVFVYTDTEVCKVKDFVSG--- 
2FAV QD-ILLA AKP LQSLQVCVQTWR----------- TOMY |AVNDKALYEQVVMDYLDNL- 
3EWO GVVNYVV VDFKISIDAMREAFKG-------- CAIRMLLFSLSQEHIDY FDATCK---- 
IHJZ GVESIAF CDLEKVVETFLEAVKNFKG----SAVKEMALVIYDRKSAEVALKVFERSL- 
IsPV SYTSVAF Br vy YPRAAAAE |LAVKTVSEFITR--HALPEQMY FVCYDEENAHLYERLLTQQ-- 
iZR3) KLKSIAF FPKQTAAQLILKAISSYFYSTMSSSIKTMYFVLFDSESIGIYVQEMAKLD- 
Secondary structure B6 as 87 a6 
Figure 3 


Structure-based sequence alignment of macro domains. The alignment is based on the PDB structures of 
the FCoV X domain (PDB code 3eti; UniProtKB reference Q98VG9), the HCoV-229E X domain (3ejg; 
UniProtKB reference POC6X1), the SARS-CoV X domain (2fav; UniProtKB reference POC6U8), the IBV 
X domain (3ewo; UniProtKB reference POC6V5), A. fulgidus AF1521 protein (1hjz; UniProtKB reference 
028751), E. coli ERS8 protein (1spv; UniProtKB reference POA8D6) and human macroH2.2A domain 
(1zr3; UniProtKB reference 075367). The secondary structure of the FCoV X domain is shown. Regions of 
high sequence identity are shaded blue. Residues forming the ADP-ribose-binding site are indicated by 
dashed magenta lines. The alignment was generated using the EBI SSM tool (http://www.ebi.ac.uk/msd-srv/ 
ssm/) and modified with Jalview. 


Phel62 


(5) 


the cubic crystal form. ADP- 
ribose molecules could be clearly 
identified from the electron 
density in molecules B, C and D. 
In all three molecules the ADP- 
ribose molecule is located in a 
binding pocket formed mainly by 
the N-terminal part of a1, the 
C-terminal part of 63, the long 
loop L¢3-a2, the N-terminal resi- 
dues of #2, the loop Lg7_as5 and the 
region between the C-terminus of 
68 and the N-terminus of a7 
(Fig. 2). The ADP-ribose-binding 
cavity is open and solvent-acces- 
sible, with a positively charged 
floor. Upon binding to the 
FCoV X domain the ADP-ribose 
molecule adopts a slightly bent 
conformation. The adenine 
moiety lies in the hydrophobic 
cavity formed by Leu52, Val78, 
Ala81, Prol55, Phel85 and 
Tyr187. The side chain of Tyr187 
stacks against the adenine ring. 
This type of interaction has also 
been observed in the structures of 


packing is described in more detail in the supplementary |= HCoV-229E (Tyr152) and in the AF1521 protein (Tyr176), but 
material. the residue is not conserved in macro domains. It is Asn in 
SARS-CoV, Leu in IBV and Phe in human macroH.2A 
(Fig. 3). Interestingly, the highly conserved Asp51 that has 


3.4. ADP-ribose-binding site 


been postulated to be critical for binding specificity does not 


By soaking a native FCoV X-domain crystal in 2 mM ADP- interact with the adenine (at a distance of 4.7 A) in this 
ribose for 1.5 h, we were able to observe bound ADP-ribose in structure. The adenosine ribose forms strong hydrogen bonds 
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to Glu191. The distal ribose interacts with the main-chain 
atoms of Arg73, Val75 and Gly76 and also forms a hydrogen 
bond to the side chain of Asn69. Two phosphate groups are 
hydrogen bonded to the main-chain N atoms of the region 
158-SVGIF-162. During the refinement process for the tetra- 
gonal crystal form, additional electron density (wF, — DF, 
synthesis) was found in the binding cleft of molecules A, B and 
C. Glycerol-3-phosphate (G3P), which is a very abundant 
metabolite in bacterial cells, could be fitted to this density 
(Fig. 4). A comparison with the ADP-ribose-bound molecule 
B revealed that the phosphate of G3P is in the same position 
as the 6-phosphate of ADP-ribose. Similarly to the ADP- 
ribose-bound structure, the region 158-SVGIF-162 forms 


Figure 4 

Glycerol-3-phosphate in the binding site. The O atoms of G3P phosphate 
are hydrogen bonded to the main-chain N atoms of the region 158- 
SVGIF-162. The surface of the protein is coloured according to 
electrostatic potential (negative charge, red; neutral, white; positive 
charge, blue); residues 158-162 are shown in purple, the O atoms of G3P 
are shown in red, the phosphate in pink and the C atoms in yellow. 


Figure 5 
Surface of the FCoV X domain in the ligand-free conformation (a) and with ADP-ribose bound (5). 
Residues Ile161 and Tyr187, which undergo significant conformational changes during ADP-ribose 
binding, are labelled in blue. ADP-ribose is shown as red sticks. 


hydrogen bonds between its backbone amide groups and the 
O atoms of the G3P phosphate. Interestingly, despite high 
concentrations of sulfate ion in the crystallization buffer, the 
binding site of the FCoV X domain is apparently selective for 
phosphate. 

Comparison of ligand-free molecules with the ADP-ribose- 
bound molecules reveals two major conformational changes 
that occur upon ligand binding. Egloff et al. (2006) showed 
that the X domain undergoes conformational changes upon 
binding to ADP-ribose, leading to the formation of a ‘bridge’ 
over the binding cleft. This ‘bridge’ is formed by close contacts 
between Ile132 and Gly48 (SARS-CoV numbering). Gly48 is 
part of a largely conserved ‘triple-glycine’ sequence 47-GGG- 
49 (FCoV numbering). In the X domains of HCoV-229E and 
IBV strain M41, in which all three positions are occupied by 
glycine residues, the ‘bridge’ between Ile126—Gly44 (HCoV- 
229E numbering) and Ile133-Gly49 (IBV strain M41 
numbering) is formed upon ADP-ribose binding (Xu et al., 
2009). However, in the case of IBV strain Beaudette the 
second glycine is substituted by serine (46-GSG-48). This 
leads to large structural changes in the region 128-SCGIF-132 
(IBV strain Beaudette numbering), which might explain why 
this X domain does not bind ADP-ribose (Piotrowski et al., 
2009). In the FCoV X domain the first glycine is replaced by 
valine (75-VGG-77). In the ligand-free cubic crystal form the 
shortest interatomic distance between residues Ile161 and 
Gly76 is greater than 6.0 A (with one exception). In the 
tetragonal crystal form with G3P bound the shortest inter- 
atomic distance between residues Ie161 and Gly76 is 4.5 A (in 
molecules A and C) or 5.7 A (molecule B). However, in ADP- 
ribose-bound molecules B, C and D the shortest distance 
between those two residues is smaller than 4 A, which results 
in formation of the ‘bridge’ (Fig. 5). Interestingly, the G3P- 
bound form seems to be an intermediate between closed and 
open conformations in which residues Ile161 and Gly76 are 
becoming closer to each other than in the ligand-free form but 
in which the ‘bridge’ is still not formed. 

In the ligand-free FCoV X domain the size of the binding 
pocket is too small to accept ADP-ribose. This largely arises 
from the conformation of Tyr187, which 
lies across the binding pocket. In ADP- 
ribose-bound molecules the side chain 
of Tyr187 rotates by 90° and orients 
along the ADP-ribose binding site, 
significantly expanding the binding 
cavity (Fig. 5). Cleft analysis using the 
PDBsum server (http://www.ebi.ac.uk) 
showed that conformational change of 
Tyr187 leads to the enlargement of the 
binding cleft by almost 600 A}, which 
creates sufficient space for the binding 
of ADP-ribose. 

In order to further examine the 
binding properties of the FCoV X 
domain, an ADP-ribose-binding assay 
was performed in vitro (see §2). The 
retention of the ligand on immobilized 
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X domain was 7.1% (compared with 0.8% retention in the 
case of the control), showing that the FCoV X domain binds 
ADP-ribose very weakly under the conditions used in the 
experiment. The determined upper limit for the dissociation 
constant was around 400 uM. This value is one order of 
magnitude higher than those obtained for orthologues from 
SARS-CoV (24 uM; Egloff et al., 2006) and HCoV-229E 
(28.9 1M; Piotrowski et al., 2009). The binding affinity of the 
FCoV X domain is also markedly lower than that of AF1521 
(126nM; Karras et al, 2005) and human macroH2A1.1 
(2.7 uM for O-acetyl-ADP-ribose; Kustatscher et al., 2005). 

Macro domains have two related but distinct properties: 
ligand binding and enzymatic activity. Two highly divergent 
macro-domain folds have recently been identified in the nsp3 
‘SARS-unique domain’ (SUD; Tan et al., 2009) that was shown 
to bind single-stranded poly(A) (Chatterjee et al., 2009) and 
oligo(G) strings (Tan ef al., 2007). Several macro domains, 
including coronaviral X domains, have been shown to bind 
ADP-ribose and related ligands. It has also been suggested 
that the function of viral X domains may be the ability to bind 
poly(ADP)-ribose, by which they interact with host-cell path- 
ways (Egloff et al., 2006). Catalytic adenosine diphosphate- 
ribose-1’’-phosphatase activity has been shown for a number 
of macro domains, including the X domain of another member 
of coronavirus subgroup la, TGEV. The ADRP reaction 
turnover was found to be very low in vitro and this led to 
uncertainty as to the physiological relevance of this enzymatic 
activity (Egloff et al., 2006). On the other hand, this activity 
has been implicated in the rate control of a yet-to-be-identi- 
fied RNA-processing pathway in infected cells (Snijder et al., 
2003). In addition, it has been reported that MHV liver 
pathology is affected in an engineered virus mutant carrying 
an X domain with a mutation in the putative active site 
(Eriksson et al., 2008). Despite the available information, the 
function of the coronaviral X domain and its importance in the 
viral life cycle have not been fully elucidated and further 
functional studies are therefore required in order to under- 
stand its role in detail. 


4. Conclusions 


The FCoV X-domain structure has a macro-domain-like fold 
with an ADP-ribose-binding pocket. It reveals high similarity 
to the structures of other members of the macro-domain 
family, especially to X domains from other coronaviruses. We 
were also able to show that the binding affinity of the FCoV X 
domain for ADP-ribose is noticeably lower than in the case of 
some other members of the family. 
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